Joint Audio-Video Fingerprint Media Retrieval Using Rate-Coverage Optimization
نویسندگان
چکیده
In this work, we propose a joint audio-video fingerprint ACR technology for media retrieval. The problem is focused on how to balance the query accuracy and the size of fingerprint, and how to allocate the bits of the fingerprint to video frames and audio frames to achieve the best query accuracy. By constructing a novel concept called Coverage, which is highly correlated to the query accuracy, we are able to form a rate-coverage model to translate the original problem into an optimization problem that can be resolved by dynamic programming. To the best of our knowledge, this is the first work that uses joint audiovideo fingerprint ACR technology for media retrieval with a theoretical problem formulation. Experimental results indicate that compared to reference algorithms, the proposed method has up to 25% query accuracy improvement while using 60% overall bit-rates, and 25% bit-rate reduction while achieving 85% accuracy, and it significantly outperforms the solution with single audio or video source fingerprint.
منابع مشابه
Different Indexing Techniques
This paper describes about Audio Indexing, Video Indexing, Content Based Image Indexing, and Content Based Multimedia Indexing i.e. Content-based indexing techniques. Indexing is concerned with compactly storing a large collection of terms and rapidly retrieving a set of candidate terms satisfying some property from a large collection of terms. Index is a structure or object in the database Ind...
متن کاملA Robust Audio Fingerprinting Algorithm in MP3 Compressed Domain
In this paper, a new robust audio fingerprinting algorithm in MP3 compressed domain is proposed with high robustness to time scale modification (TSM). Instead of simply employing short-term information of the MP3 stream, the new algorithm extracts the long-term features in MP3 compressed domain by using the modulation frequency analysis. Our experiment has demonstrated that the proposed method ...
متن کاملITU MSPR TRECVID 2010 Video Copy Detection System
In this paper we describe the system designed by the ITU MSPR Group for content based video fingerprinting as applied to the TRECVID 2010 Content Based Copy Detection (CBCD) benchmark. This year focus of the system was on integration of audio and video fingerprinting to improve the robustness to attacks. The proposed system consists of three main modules: Audio/video fingerprint extraction, aud...
متن کاملIstanbul Technical University at TRECVID2008
In this paper we describe the system designed by the ITU MSPR Group for content based video fingerprinting as applied to the TRECVID 2010 Content Based Copy Detection (CBCD) benchmark. This year focus of the system was on integration of audio and video fingerprinting to improve the robustness to attacks. The proposed system consists of three main modules: Audio/video fingerprint extraction, aud...
متن کاملAssessing Semantic Relevance by Using Audiovisual Cues
This paper presents two complementary approaches for assessing semantic relevance in video retrieval—(1) adaptive video indexing and (2) elemental concept indexing. Both approaches make extensive use of audiovisual cues. In the former, retrieval is performed by using implicit semantic indices through audio and visual features. Audio features are extracted by statistical time-frequency analysis ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1609.01331 شماره
صفحات -
تاریخ انتشار 2016